首页> 外文OA文献 >ProLanGO: Protein Function Prediction Using Neural~Machine Translation Based on a Recurrent Neural Network
【2h】

ProLanGO: Protein Function Prediction Using Neural~Machine Translation Based on a Recurrent Neural Network

机译:proLanGO:基于神经〜机器翻译的蛋白质功能预测   基于递归神经网络

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

With the development of next generation sequencing techniques, it is fast andcheap to determine protein sequences but relatively slow and expensive toextract useful information from protein sequences because of limitations oftraditional biological experimental techniques. Protein function prediction hasbeen a long standing challenge to fill the gap between the huge amount ofprotein sequences and the known function. In this paper, we propose a novelmethod to convert the protein function problem into a language translationproblem by the new proposed protein sequence language "ProLan" to the proteinfunction language "GOLan", and build a neural machine translation model basedon recurrent neural networks to translate "ProLan" language to "GOLan"language. We blindly tested our method by attending the latest third CriticalAssessment of Function Annotation (CAFA 3) in 2016, and also evaluate theperformance of our methods on selected proteins whose function was releasedafter CAFA competition. The good performance on the training and testingdatasets demonstrates that our new proposed method is a promising direction forprotein function prediction. In summary, we first time propose a method whichconverts the protein function prediction problem to a language translationproblem and applies a neural machine translation model for protein functionprediction.
机译:随着下一代测序技术的发展,确定蛋白质序列是一种快速而廉价的方法,但是由于传统生物学实验技术的局限性,从蛋白质序列中提取有用的信息相对较慢且昂贵。蛋白质功能预测一直是填补巨大蛋白质序列与已知功能之间的空白的长期挑战。本文提出了一种新方法,将新提出的蛋白质序列语言“ ProLan”转换为蛋白质功能语言“ GOLan”,将蛋白质功能问题转化为语言翻译问题,并建立基于递归神经网络的神经机器翻译模型,将“ ProLan”语言改为“ GOLan”语言。我们通过参加2016年最新的第三次功能注释关键评估(CAFA 3)盲目测试了我们的方法,并且还评估了我们的方法对在CAFA竞争后释放了功能的所选蛋白质的性能。在训练和测试数据集上的良好性能表明,我们提出的新方法是蛋白质功能预测的有前途的方向。综上所述,我们首次提出了一种将蛋白质功能预测问题转换为语言翻译问题并应用神经机器翻译模型进行蛋白质功能预测的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号